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Abstract It is possible to consider stochastic models of sequence evolution in 
phylogenetics in the context of a dynamical tensor description inspired from 
physics. Approaching the problem in this framework allows for the well devel- 
oped methods of mathematical physics to be exploited in the biological arena. 
We present the tensor description of the homogeneous continuous time Markov 
chain model of phylogenetics with branching events generated by dynamical op- 
erations. Standard results from phylogenetics are shown to be derivable from the 
tensor framework. We summarize a powerful approach to entanglement measures 
in quantum physics and present its relevance to phylogenetic analysis. Entangle- 
ment measures are found to give distance measures that are equivalent to, and 
expand upon, those already known in phylogenetics. In particular we make the 
connection between the group invariant functions of phylogenetic data and phy- 
logenetic distance functions. We introduce a new distance measure valid for three 
taxa based on the group invariant function known in physics as the "tangle". All 
work is presented for the homogeneous continuous time Markov chain model with 
arbitrary rate matrices. 



1. Introduction 



Stochastic methods which model character distributions in aligned gene sequences 
are part of the standard armoury of phylogenetic analysis |28l(il7l21ll(i| . The evo- 
lutionary relationships are usually represented as a bifurcating tree directed in 
time. It is remarkable that there is a strong conceptual and mathematical analogy 
between the construction of phylogenetic trees using stochastic methods, and the 
process of scattering in particle physics 11 . It is the purpose of the present work 
to show that there is much potential in taking an algebraic, group theoretical ap- 
proach to the problem where the inherent symmetries of the system can be fully 
appreciated and utilized. 
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Entanglement is of considerable interest in physics and there has bee n much 
effort to elucidate the nature of this most curious of physical phenomena [251131 
12141!)) . Entanglement has its origin in the manner in which the state probabil- 
ities of a quantum mechanical system must be constructed from the individual 
state probabilities of its various subsystems. Whenever there are global conserved 
quantities, such as spin, it is the case that there exist entangled states where the 
choice of measurement of one subsystem can affect the measurement outcome of 
another subsystem no matter how spatially separated the two subsystems are. 
This curious physical property is represented mathematically by nonseparable 
tensor states. Remarkably, if the pattern frequences of phylogenetic analysis are 
interpreted in a tensor framework it is possible to show that the branching pro- 
cess itself introduces entanglement into the state. This is a mathematical curiosity 
that can be studied using methods from quantum physics. This is a novel way of 
approaching phylogenetic analysis which has not been explored before. 

In section [2| we begin by considering the stochastic model of sequence evolu- 
tion in phylogenetics using continuous time Markov chains (CTMCs)^. We go on 
to present this model in a dynamical tensor description where the probability dis- 
tribution is given by the components of a tensor in a preferred basis and the time 
evolution is generated by linear operators acting on the space. The phylogenetic 
branching process is then developed formally in sectional by introducing a linear 
operator which introduces an extra product in the tensor space. This operator is 
shown to be unique given that the probability distribution must be conditionally 
independent from branch to branch. We also show that the branching process 
introduces entanglement into the state space. The stationary states of the system 
and the pulley principle, which describes the unrootedness of phylogenetic trees, 
are presented in the tensor framework in sections and respectively. Section El 
is a short review of current methods of analysing entanglement in terms of group 
orbits and invariant functions. In sections and |S1 the work specializes to the 
cases of two phylogenetic characters and small numbers of taxa making up the 
analysis. In sectional the group orbits and invariant functions for the case of two 
taxa are presented and explicitly solved to show that the invariant function is the 
well known logdet distance. In sectionEDwe go on to study the case of three taxa 
where the invariant function known in physics as the tangle is shown to give a new 
distance measure for phylogenetics. This previously unstudied distance measure 
is found to be useful analytical tool in the reconstruction of phylogenetic trees, 
fsection llljl . 



2. Tensor methods in phylogenetic branching 



We consider a system consisting of A*' sites each of which takes on one of K distinct 
characters. Associated with such a system we have the set of frequencies 

^ total number of occurrences of character i 
■ = N ' 



^ The reader should note that the model considered in this paper is the general 
Markov model on a tree together with the additional assumption that the tran- 
sition matrices are non-singular and arise as an analytical continuation from the 
identity matrix. For a recent review of the hierarchy of phylogenetic models see 

u 
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We model these frequencies by defining a set of probabilities which are the theo- 
retical limit 

Pi — lim Pi. 

iV-^oo 

Introducing the K dimensional vector space V with preferred basis {e^}, we can 
associate the set of probabilities with a unique vector 

p = poeo +piei + ... + pK-iex-i- 

The probabilities are assumed to evolve in time as a homogeneous CTMC |1()I2()| . 
This amounts to assuming that the character state at any time t, conditional on 
the character state anytime t' < is independent of the character state at any 
earlier time t" < t' |6I7I21| . The defining relation for the time evolution is 

±p(t)^R-p{t), (1) 

where i? is a linear operator. To preserve reality of the probabilities and the 
property ~ l,Vt it follows that in the preferred basis i? is a real valued 

zero column sum matrix. A formal solution of Q for time independent R is found 
by exponentiating 

p{t) =e«*p(0) 

:=M(t)p(0). ^ ' 

We refer to M(t) as a Markov operator. Taking its derivative 

^M(t) = RM{t), 

we observe that 

As is well known, in order to conserve positivity of the probabilities it must also 
be the case that 

Rrj > 0, Vi / i; 

R^^ < 0. 



In phylogenetics we consider the case where we have multiple, aligned, site, 
K character systems labelled by {1,2, ...,L}. We refer to the individual systems 
as taxa. What is now of interest is the set of frequencies 

^ _ total number of occurrences of pattern 1112. ..il 

Pili2---iL JY I 

ii, 12, iL =0, K ~1. 

We model these frequencies by again defining a set of probabilities which are the 
theoretical limit 

P»l»2 - »i .Jil^ Piii2.-.iL- 

The system is assumed to have evolved in time as a homogeneous CTMC. 

Introducing the random variables xi,X2, xl each of which take on values in 
the individual character spaces {ii , 22, il = 0, isT — 1} and x = {xi,X2, ..■.,xl) 
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which takes on values in the dimensional character space {iii2--iz,} we can 
write the transition probabilities of the Markov chain as 

V{x = ili2---iL,t\x = jlj2--.jL,0) 

= F{xi=il,t\xi =jl,0)F{x2 = i2,t\X2 =j2,0)-.-fiXL=iL,t\XL =jL,0) 

From this we notice that it is possible to construct the state space setting of the 
tensor product V (S> 1^(S> ••■ ® V = y®^ where the probabilities are associated with 
the tensor 

= J2 Pn<2-ii,(*)eii ®ei2 ig)... igiei^. 

il,---,iL 

Time evolution of this system is generated by the transition probabilities of the 
Markov chain which in tensor notation can be represented as linear operators 
acting on the initial pattern distribution 

P{t) = M\t) ® M'\t) ® ... ® M^(t)P(0), 

where distinct rate parameters have been allowed on each component of the tensor 
product space. (The reader should note that M'' refers to the Ith component of the 
tensor product space and is not meant to indicate the Ith power of the operator 
M.) 

If we have a phylogenetic tensor P{t) which describes the pattern distribution 
of characters for L taxa, it is possible to find the reduced tensor P{t) which gives 
the pattern distribution for a subset of I taken from the original set of L taxa. 
The correct operation is given by 

Piii2-iLii) ^Pni2...i,ii) = J2 Piii2-iLit), 

P{t)= Pn<2...nWen®ei, ig)...ig)ei,, 

all i's 

where s,i, label the taxa which are not in the subset. Such a reduced tensor 
will be referred to as a marginal distribution. (In this work we will follow the 
convention from here on that if a summation sign has no suffixes it is assumed 
that all indices inside the expression are to be summed over.) 



3. Phylogenetic Branching 



Having developed the general tensor description of the homogeneous CTMC model 
of sequence evolution in phylogenetics, in this section we will now introduce a 
formalism for describing the branching events. We do this by defining a formal 
operation on the tensor space. 

Consider the case where we have a single taxon branching into L = 2 taxa. The 
corresponding mathematical operation is F ^ V(S)V . If the branching event occurs 
&it = T we are required to determine the appropriate pattern probabilities Pi^i^ (t) 
given the probabilities Piij). (In this paper t is considered to be an additional 
model parameter alongside the parameters in the rate matrices). Intuitively a 
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reasonable choice is the initial set Pii{r) = Pi(r) and Pijir) = 0, Vi / j. 

In order to formalize this we introduce the splitting operator S:V^Vi3V. 
The most general action of S on the basis elements of V can be expressed as 

where -T/'" are an arbitrary set of coefficients. Standard models of phylogenetics 
assume conditional independence upon the distinct branches of the tree |22l(il7l 
This assumption will presently be used to determine the exact form of the 
splitting operator. It is only necessary to consider initial probabilities of the form 

7 = 0,l,...,if-l 
so that the initial single taxon state is 

i 
i 

Directly subsequent to the branching event the 2 taxa state is given by 

We implement the conditional independence upon the branches by setting 

f{x — iii2,t = t'\xi — X2 = ■y,t — T) 

— P(xi — ii,t — t'\xi—'y,t — t)V{x2 = i2,t = t'\x2—'y,t = T). 

Using the tensor formalism the transitions probabilites can be expressed as 

r{xi ^ ii,t = t'\xi = 'y,t = t) Ml^^:{t' ^ t)SI 



{x2 = i2,t = t'\x2 = 7,t = r) = ^ Mi^i(t' - t)5 



(5) 



P(a; = 4112, t = = a;2 = 7, t = r) 

k,l,m 

Implementing lO leads to the requirement that 

A'' = S-S- (6) 
and the basis dependent definition of the splitting operator 

5 ■ Ei = d. (7) 
The action on the components of a vector is such that 

P^J{r) ^ P^{T), i=j- 

= 0, i / j; 
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which is consistent with our intuitive guess. As will become apparent, the opera- 
tion defined in ^ takes disentangled states into entangled states. 

The splitting operator is an important structural element of the tensor de- 
scription and its symmetry properties [T| are intimately related to the existence 
of discrete transform methods for particular classes of phylogenetic model. More 
general forms of the conditions (01 can be envisaged under weaker assumptions 
than considered here. Finally, in the particle scattering picture for phylogenetic 
branching ITIJ the splitting operator is implemented as an interaction term. For 
present purposes the utility of S is that it allows us to write down a formal expres- 
sion for a system which undergoes a branching event. Suppose a system described 
by the tensor P € \/®^ undergoes a branching event on its r*** branch, the new 
system is described by 

P (ll ® I2 ® ... ® Ir-l ® <5 ® lr + 1 ® ... Il)P e V^^ + \ (8) 

where Is is the identity operator on the sth component of the tensor product 
space. We introduce the convention that the tensor space is labelled so that under 
the action (O the probabilities are given by 

Pn*2---»I, Pili2---»r!r + l»,-+2---»j:,»t + l ~ Pi 1 12 ■ ■ • V V + 2 ■ • • «!, » t + 1 '^V V + 1 ' 

We introduce parameter sets labelled on the edges of the tree by {ea, a = 1, 2, ...} 
and defined as ea = {oa, /3a, ia} to distinguish entries in the rate matrices 
and branch lengths on different edges. Given the solution ^ it should be noted 
that there is an edge scaling symmetry 

{aa,l3a, ■■■} ^ {Acta, A/3a , ..■}, 
ia ^ A ta 

which leaves the model invariant. This symmetry is well known in the literature 
|(il7) and indicates that it is not possible to distinguish between a fast rate of evo- 
lution and a long time period of evolution. When the rate parameters {aa, /3a, ...} 
are identical on all edges of the tree, a "molecular clock" is said to be in opera- 
tion. Under the circumstances of a molecular clock it is possible in principle to 
determine the time period. 

The edges of a tree are labelled using an away-from-the-root and left-to-right 
ordering convention. As an example the expression which defines the most general 
homogeneous CTMC on the tree (1((23)4)) is given by 

Pt2T3 (t) 

where p is the initial single taxon distribution, T2,T3 define the branching times 
and ti — t, t4 = t — T2, ts = te — t — T2 — T3, (see figure^. 

In this work we will derive results which are independent of the particular rate 
parameters which occur in the Markov operators of the model. 



4. Stationary states 



A stationary state of a homogeneous CTMC is defined as the vector, tt, which 
satisfies 

R-TT = 0. (9) 
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Figure 1. The CTMC model of four taxa on the tree (1((23)4)). 



The stationary state can be generalized to the case of a tensor, U £ y'^^ ^ associ- 
ated with a set of pattern probabilities which satisfies 

Ri® R2® Rl ■ n 

and has solution 

i7 = TTl (g) 7r2 ® ... ® TTL. 

It should be noted that 77 is a completely separable state and that any state tends 
to the stationary state as t ^ 00 |lf)| . 



5. The pulley principle 



We consider the case of a single taxon which is in state p £ V at time t = 0. We 
implement a branching event aX t — Q and let the system evolve under arbitrary 
Markov operators to produce the state at a later time t 

P{t) = M,^®M,^5-p eV®V, (10) 

(see figure I2I1. 




The dual vector space V* has a basis which consists of the linear maps {/' : 
V ^ F^i = 1, K} which satisfy /'(ej) = Sj. Using this dual basis we can define 
an isomorphism (j> : V (^V L{V) as (j>{ei ® ej) Cif^ and rewrite HlOfl as 



0(P(t)) = M,,<^(5-p)M7^, 
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where j indicates matrix transpostion. 

The pulley principle is then a direct consequence of the existence of states p 
such that M(j>{S ■ p) = (f>{5 ■ p)M'' for a given Markov operator M. A solution can 
be found to be 

P = TT, 

where M ■ -r — -k so that tt is a stationary state under M. Putting this together 
we can conclude that 

Pi(f) := [M,, (g) M.J (5 ■ TTi = [1 ® M.^M.J 5 ■ TTi, 

or 

P2{t) ■- [M,, 8 <5 ■ TTa = [M,,M,^ ® 1] 5 • vra, 

for the special case where Me^ -^1=^1 or ■ 7r2 = 7i"2, respectively. This tells 
us that in the case where the initial distribution is a stationary state of one of 
the rate matrices on edges 1 and/or 2 the placement of the root of the tree is 
not strictly determined. This property has been observed previously to hold when 
the Markov chain is reversible, BtZj. However in the tensor framework presented 
here we have refined the pulley principle by showing that one requires only that 
the initial distribution tt be a stationary distribution upon either branch of the 
tree for one to be able to "pull" that same branch through the initial distribution. 
This is a less stringent requirement than that of reversibility |lfl| . 



6. Group invariants and orbit classes 



In quantum physics there has been much interest in quantifying and/or classifying 
the phenomenon of entanglement between multiple non-local systems The 
correct description involves expressing the total state vector as belonging to the 
multi-linear space built from the tensor product of the individual state spaces. 
Entangled states exhibit non-local behaviour and correspond mathematically to 
the non-separable property of such state vectors. 

A systematic approach to the classification problem is to study the orbit classes 
of the tensor product space under a group action which is designed to preserve the 
essential non-local properties of entanglement. The orbit of an element h belonging 
to the (multi)-linear space H under the group action G is defined as the set of 
elements {h' £ H : h' = gh for some g € G}. 

In quantum physics the appropriate group action is known to be the set of 
SLOCC opera tors, (Stochastic Local Operations with Classical Communication) 
|13I4I9I17I15| . Mathematically SLOCC operators correspond to the ability to 
transform the individual parts of the tensor product space 77 = //i ® i/2 ® ■.• ® 
Hn with arbitrary invertible, linear operations. These operators are expressed by 
group elements of the form 

ff = 51 ® P2 ® ... (g fln, 

where n is the number of individual spaces making up the tensor product, and 
g^ e GL{H,). 

The task is to identify the orbit classes of a given tensor product space under 
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tlie general set of SLOCC operators. Powerful tools in this analysis are the meth- 
ods of classical invariant theory. If H is defined on a field F, the set of invariant 
functions /(G) is defined as 

/(G) = {/ : H -> F, s.t. f{gh) = [Aetg]''f(h), fcG0,l,2,..., VffeG,/iG/r}. 

Clearly such invariants are relatively constant up to the determinant upon each 
orbit class of H . The set of invariants can, after some trivial definitions, be given 
the structure of a (graded) ring and it can be shown that there exists (under the 
action of the general linear group at least) a finite set of elements which generate 
the full set on a given linear space. It can also be shown that the set of orbit classes 
of a given linear space can be completely classified given a full set of invariants 
on that space |18| . 

The motivation of the present work is the possibility that the study of orbit 
classes can be used to elucidate interesting results in phylogenetics. 



7 . K — 2 characters and qubits. 



From here on we specialize to the case where the set of characters consists ofK = 2 
elements {0, 1}. When we are dealing with a single taxon the phylogenetic state 
p mathematically corresponds to a vector belonging to R^. In quantum physics 
the corresponding two dimensional object is the "qubit" which in turn belongs 
to the vector space and if we take multiple qubits the correct state space is 

we showed previously the case of the phylogenetics 
of multiple taxa the corresponding state space is // = (g) ... Cg) R^. In the 
forgoing work we will be implicitly taking advantage of the fact that R C C. 



8. Canonical Forms 



We wish to construct the orbit classes of = ® under the group action 
GL{C^) X GL{C^). We have seen that for any state h £ H we can find an isomor- 
phic state (f>{h) G //(C^) which transforms under the group action as = 
gitpgl- Hence we can answer the orbit class problem by taking a canonical 2x2 
matrix X and considering the set of matrices M : M — AXB; A, B £ GL{C^). 



Theorem 1. The vector space V ® V where V = C'^ has three orbits under the 
group action GL{V) x GL{V). Under the isomorphism V (SiV ^ L{V) the orbits 

are characterized by the following canonical forms: (i) Null- orbit X ~ >' (''■''■) 

Separable-orbit Y = ? (m) Entangled-orbit Z — ^g 'j'^ • The separable and 

entangled- orbits can be distinguished by the determinant function. 



Proof, (i) The null-orbit has only one member, the null vector; which is of course 
unchanged by the group action, (ii) We are required to show that the set of 
2x2 matrices M = {S : S = AYB;A,B G GL(y)} is all matrices such that 
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det(5') = 0. We begin by taking 
ad — be = 0. Clearly the matrices 

also belong to A^. So without loss of generality we can take a ^ and it is an 
easy computation to show that 

so that A4 is the set of 2 x 2 matrices with vanishing determinant, (iii) Clearly 
any 2x2 matrix A'^ with non-zero determinant can be written as A'^ = AZB where 

A,BeGL{C^). a 



a general member oi M, S — ( " ^ ) with 



Corollary 1. The orbits o/H = ® under SL{C^) x SL{C^) are labelled by 
the determinant function det[(f)(h)]. 



For further discussion see [211314) . 



9. The concurrence 



We consider the case of L = 2 taxa derived from the branching of a single taxon 
at t = followed by arbitrary Markov evolution. The state is represented by a 
tensor in 77 = (g) and is expressed as 

Pit) ^ M,^®M,J -p, (11) 

where t\ — t2 — t, (see figure EJ. 

The most general rate matrix depends on 2 parameters and can be expressed 

as 



R = 



-Q (3 
a -13 



where a and P are real. A simple free parameter count in expression lllll yields, 
taking into account the scaling symmetry on edges, 1 free parameter due to each 
transition matrix and 1 free parameter due to the initial state p. Hence there are 
a total of 3 free parameters and given that the components of the = 4 dimen- 
sional P{t) are probabilites we conclude that all free parameters are accounted 
for. 

In quantum physics the tensors representing 2 qubits correspond in phyloge- 
netics to the case of L = 2 taxa with K = 2 characters. As we have shown in the 
previous section, there exist 2 nontrivial orbit classes which are completely distin- 
guished by the relative invariant known as the concurrence, C ; 7i = — * C. 
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Using the formalism we have developed we can express the concurrence of the 
state /i G 7i as 

C{h) ^ det[(l>{h)], (12) 

which satisfies 

C{h') ■. = C{gi®g2h) 
= Aet[gi(j>{h)gl] 
= det[5i]det[g2]C(/i), 

so the concurrence is truly a relative invariant. This can also be expressed explicitly 
as 

C{h) = ^ hijhkieiktji, (13) 

where e is the completely anti-symmetric tensor with eoi = 1. The two orbit 
classes correspond to the completely entangled Bell state and the completely dis- 
entangled, and hence separable, state. The entangled orbit is the set of states 
equivalent to the Bell state 

hbeii = 75(^0 ® eo -I- ei (g) ei), 

whereas the dis-entangled orbit is the set of states which take on the separable 
form 

h — u(>$v, 

where 'u,v £ . The concurrence vanishes if and only if the state belongs to the 
separable orbit class. This property can be used to distinguish the orbit classes. 

In phylogenetic analysis the concurrence can be used to establish the magni- 
tude of divergence between a pair of taxa derived from a single branching event. 
The case where there is no phylogenetic relation cannot be distinguished from the 
case of infinite divergence. When there has been infinite divergence we have 

lim P{t) = n — TVti ® 

t — »oo 

which is a separable state and hence has concurrence 

C(7r,i (g)7r,2) = (/3i/?2)(aia2) - (/3ia2)(Qi/32) = 0. 
The concurrence of the phylogenetic state Ulljl is given by 
C{P{t)) = det[M,J det[M,J det[(;!>((5 ■ p)] 

and using the operator identity det[e^] — e*''^ can easily be computed 

C(P(t)) = e*'''^i*ie*''[^^*'poPi 

From this explicit form it can be seen that the concurrence is some kind of measure 
of phylogenetic divergence. Prom the concurrence we would like to construct a 
formal distance function. In the case of general L taxa it is of course possible to 
construct a reduced tensor which represents the pattern distribution upon any 
pair of the taxa. This is achieved using the prescription defined in equation (QJ. 
One can then go on to calculate the concurrence between any given pair of taxa 
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taken from the set of L. We define a distance function, dij, between any pair of 
taxa taken from a set of L as 



da : = 0. 



(15) 



From the definition of the concurrence Hl,'-i|l it is trivial to show that dij — dji. At 
the time, r, of the branching event at which the pair of taxa under consideration 
were created the concurrence took on the value poPi where 

P-y = -Paia2...ai_i7ai + i...aj_i7aj.|.i...aj:, (t), j'-^g-j 

all a's 

Of course we have < poPi < 1 and after this time the concurrence scales with 
the determinant det[Mi] det[Mj] which is also strictly positive and less than unity. 
We can conclude that the concurrence between a pair of taxa is always strictly 
positive and less than unity and that the distance function, dij, is also strictly 
positive. The triangle inequality 

dij + djk > dik 
is equivalent to the statement that 

which invoking 11411 can be expressed as 

-(ai+fSi+aj+f3j)t (ij) (ij) -(aj+f3j+c,k+Pk)t(3k) (jk) 

K Po Pi ^ Po Pi 

< „-(«i+/3i+afc+/3fc)t (ife)„(ife) 

i f= Po Pi ■ 



Here p^^^' is the single taxon marginal distribution existing at the node closest to 
the root which joins taxon i to taxon j. These marginal distributions are calcu- 
lated as in llfifl . Now depending on the branching structure of the tree we have 
_ pUk) pUk) _ p{ik) ^jj^^ ^j^g distance function satisfies the triangle 
inequality. 

T he dist ance function H15|l is well known in phylogenetics as the logdet dis- 
tance |22lfi] . 



10. The tangle 



We consider the case of L = 3 taxa derived from the branching of a single taxon 
at t = followed by arbitrary Markov evolution, an additional branching event 
on edge 1 or 2 at t = r and then additional arbitrary Markov evolution. For the 
case when the second branching event occurs on edge 2 the tree is represented by 
(1(23)) and the state is represented by a tensor in H = ® ® R as 



Prit) = [M,i (g) M,3 (g) M,J1 (g) S[l ig M,2]S ■ p, 



(17) 
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12 3 

Figure 3. The CTMC model of three taxa on the tree (1(23)). 

where ti — t, t2 = t, t-^ = t4 — t — r, (see figure H])^. 

It is known that there are 6 orbit classes of (g) ® under GL(C^) x 
GL(C^) X GL{C'^) that can be distinguished by fu nctions of the concurrence and 
another relative invariant known as the tangle |4l9j . We begin by defining a partial 

concurrence operation : (g) ® ^ (g) C^ a = 1, 2, 3.} as 

Cl(h) = hijkhlmn^jm^knei ® 6;, 

C2{h) = ^ hijkhlmn(-il(-knej (g) 6™, (18) 

From these definitions it is easy to see that 

Ci{h') : = Ciigi ® g2 ® gsh) 

= [det(c/2)det(p3)lffi ® giCi{h), 

with similar expressions for C2 and C3. The tangle, {T : g) g) C}, can 

be defined as 

T = CoCa, 

where we will confirm shortly that T is independent of the choice of a. The tangle 
is a relative invariant satisfying 

T{h') = [det(5i) det(<?2) Aet{g-i)fT{h), (19) 

and, in analogy to 1131 can also be written in the form 

The 6 orbit classes are described by the completely dis-entangled states 

h —u g) w g) ui, 
u,v,w £ ; 

the partially entangled states 

ha ■■ a =1,2, 3, 

^ The use of pattern frequencies for the case of three taxa has been studied in 
relation to the problem of tree reconstruction by Pearl and Tarsi |19| and Chang 
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which form 3 orbit classes characterized by the separability of the canonical tensors 



(1) hij}^ 


= UiVjk 


(2) hijk 


— UijVk 


(3) ftijfc 





the completely entangled states equivalent to the GHZ state 

hghz = -^{eo ® eo ® eo + ei ® ei ® ei); 
and the completely entangled states equivalent to the W state 

hn, = 7f (eo ® eo ® ei + eo ® ei (g) eo + ei (g) eo ® eo). 

The tangle and the concurrence and its partial counterparts can be used to fully 
distinguish these orbit classes. For the completely dis-entangled tensors we have 

Ca(/i)=0, Va; 

whereas for the partially entangled states we have 

Ca{K')=0, iff 5^=0. 

The GHZ and W orbits are distinguished by calculating the tangle 

T{hgh.) / 0, T{hw) = 0. 

From these properties we will now show that the tangle is indeed independent 
of the choice of the partial concurrence. Begin by introducing an action of the 
symmetric group, S3, on the tensor product space defined by 

(J e Ss : Vl ® V2 ® ^ V^l ® Va2 ® VaS.- 

Now since the tangle vanishes everywhere except on the GHZ orbit and crhghz = 
hghz we need only consider the value of the tangle on elements which lie on the 
GHZ orbit. We take as our element a; = (?i ® (72 ® gshghz and proceed. From its 
definition the tangle satisfies 

T{ah) = CoCaiah) = CoCaa{h), 

V/i G 7i. For our element x we have 

T{ax) = T{gal ® ga2 ® Qashghz), 

= [detgfT{hghz), 
= T(x). 

This shows that 

CoCa = CoCaa, 

Vcr e S3. 



We now determine which orbit the L = 3 phylogenetic state l|17|l lies in. The 
easiest way to do this is to calculate the various invariants at the time of the 
branching event. To this end we use Jsj so that components of the state are 



(21) 



Entanglement Invariants and Phylogenetic Branching 



15 



From this expression we might expect that the entanglement of the state given by 
the tangle just after branching can be expressed as a function of the entanglement 
in the state given by the concurrence just before branching. This is indeed the case. 
By direct computation using lEU it can be shown that at the time of branching 
the tangle is given by 

T(P.(r))=CoC3(P.(r)) 

= -2[C(M,, ®M,,5■p)]^ 

where = {ai,/3i;t2}. The tangle has the value 

T(P,(r)) = -2e-'("i+''i+"^+'^^'"[poPi]^ (22) 
Subsequent to this the tangle takes on the value 

T{Pr{t)) = (det[M,,,]det[M,3]det[M,J)'T(P,(r)) 

^ _2g-2("l+/3l+"3+/33+t»4+/34)(t-T)g-2{Qi+/3i+c«2+/32)T^p^p^^2 

SO that the phylogenetic state belongs to the GHZ orbit for all finite t. This equiv- 
alence has in fact been observed in a different context in M should be noted 
that as t ^ oo the tangle tends to zero and the state becomes the disentangled, 
stationary state which corresponds phylogenetically to the case where the taxa 
are unrelated. This is of course what we would expect if the taxa have diverged 
so much that there is no longer a possibility of establishing any relation between 
the taxa. 



11. The tangle and distance functions 



The tangle gives us a new tool for calculating the phylogenetic distance between 
a set of three taxa. As was the case with the concurrence it is possible to calculate 
the value of the tangle for any subset of three taxa taken from a set of L taxa. 
We use the tangle to define a three taxa phylogenetic distance given by 

dijk I log 2- ilog [~T {J2 Paia2...ar^ea, (g) Ca^ ® J ] . (23) 

For the case under consideration this three taxa distance takes on the value 

rfi23 = (oi + I3i)t + (aa + /?2)r 

+ (as + /?3 + Q!4 + /34)(i - t) - logpoPi- 

At this point it is illuminating to compare the values of all the partial distance 
functions 0(12,^23 and di3 with the value of the tangle for the case of the branching 
structure (1(23)). The distance functions take on the values 

rfi2 = (ai + /3i)t + {ct2 + P2)t + [ct'i + I3'i)(t - t) - logpopi, 

d23 = ("3 + ft + 04 + P4,){t -t) - logpop'i, 

di3 = (ai + I3i)t + (q2 + P2)t + [ai + I3i){t - r) - logpopi, 
where p = Mt^P- We define weights on the edges 1, 2, 3, 4 of the tree to be 

a; = (ofi + (3x)t, 
y = {a2 + I32)t, 
z= {a3 + I33){t~ t), 
W = {a4+ P4){t - t), 
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respectively. It is possible to solve the distance function equations for the weights 

X + y = di2 + drs - di23 + logpoPi, 

Z = dl23 — dl3, 
W = di23 - di2, 
logPopi = 2dl23 — dl2 - dl3 — d23- 

In summary we find that, if we assume the branching structure of the tree, we now 
have a prescription that gives us the evolutionary distances between three taxa 
up to errors caused by the fact we cannot determine the marginal distribution, 
p, which lies the at top node of the tree. This marginal distribution must be 
estimated using some resonable prescription. 

To elucidate the value of including the tangle in the analysis we present the 
corresponding set of branch lengths calculated without using the tangle 

x + y = |(di2 + di3 - ^23) - logpopi - I logpop'i, 
z = i(di2 + d23 - di3) + I logpopi, 
w = |(d23 + di3 - di2) + I logpopi. 

This comparison makes clear the advantage of including the tangle in the 
analysis of branch lengths. 



12. Conclusion 



We have shown that it is possible to present the continuous time Markov chain 
model of phylogenetics using a dynamical, tensor state space description. We have 
shown that the branching process introduces entanglement into the description, 
and that the group invariant approach to entanglement in quantum physics can be 
used in phylogenetics to derive distance functions between taxa. The main origi- 
nal result presented was the use of the tangle as a new distance measure between 
three taxa. 

Entanglement measures can be extended to the cases oi K > 2 and L > 3 
and will be explored in future work. In particular the inva riant th eory for the case 
of -ftT = 2, L = 4 is established in the physics literature '24F14J and progress in 
interpreting the theory in the phylogenetic context is underway. 

The use of invariant functions to distinguish alternate tree branching struc- 
tures was not achieved in this work, but future work will explore sharpening the 
group action from GL{V)'^^ to the more stringent Markov operator action and 
it is hoped that this will allow branching structures to be distinguished using the 
corresponding invariant functions. This will allow results establishing that if and 
only if the values of invariant functions taken on an arbitrary character distribu- 
tion satisfy certain relations, then it can be concluded that the distribution was 
generated from a Markov model on a given branching structure. 

There is the remaining issue that the distance functions are known only up 
to log functions of the marginal distributions at the nodes. An avenue of furthur 
research would be to determine under what conditions these log functions can 
be treated as statistically insignificant. This would involve studying how far the 
sequences can diverge before the information content of the distribution becomes 
so small that the calculation of edge weights is misleading. That is, under what 
circumstances are the edge weights large enough so that terms such as logpopi 
can be treated as small error? The problem is that the edge weights are a measure 
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of the magnitude of divergence, which if allowed to become large enough means 
that the information content of the distribution is small, and hence there is no 
potential to establish phylogenetic relation anyway. It would be fruitful to explore 
these issues analytically. 
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